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Abstract 

We present here an optimized and parallelized version of the aug- 
mented space recursion code for the calculation of the electronic and mag- 
netic properties of bulk disordered alloys, surfaces and interfaces, either 
flat, corrugated or rough, and random networks. Applications have been 
made to bulk disordered alloys to benchmark our code. 



1 Introduction 

The aim of this communication is to present and describe a computational 
package to handle first-principles density functional (DFT) based studies of 
electronic structure of systems without long-ranged lattice translational sym- 
metry. Bulk disordered alloys and surfaces and interfaces which are either flat, 
corrugated or rough fall under this category which our formalism should be 
able to take care of. The aim is also to go beyond the usual mean-field ap- 
proaches like the coherent potential approximation (CPA) and be able to take 
into account configuration fluctuations of the local environment. Lack of lattice 
translational symmetry means that the standard reciprocal space techniques 
based on the powerful Bloch theorem can no longer be applicable and we shall 
depend on alternative techniques based purely on real space approaches. Our 
formalism will be a marriage of three distinct methods which have been indi- 
vidually applied extensively : namely, the recursion method (RM) of Haydock 
et.al. [l]-[2], the augmented space method proposed by one of us [3]- [4] (ASR) 
and the tight-binding, linear muffln-tin orbitals method (TB-LMTO) [S]. The 
last mentioned provides us with a DFT self-consistent sparse representation of 
the Hamiltonian in a real-space minimal basis {|i?„L)} which spans the Hilbert 
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space T-L. Here Rn labels the sites where the ion-cores sit and L — {£ma) are 
the angular momentum indexes. For a disordered system the matrix elements of 
the Hamiltonian representation in this basis are random. This representation is 
then taken over by the ASR to generate a modified Hamiltonian representation 
in the outer product space of Ti. and the space C of configuration fluctuations 
of the random parameters. This modified Hamiltonian in the augmented space 
Ti ® C represents a collection of all possible Hamiltonians for all possible con- 
figurations of Hamiltonian representations. Once this is done the RM allows 
us to obtain the matrix elements of the Green functions related to the Kohn- 
Sham equation for the electronic states. The augmented space theorem (AST) 
[B] relates a specific matrix element of the Green function in the augmented 
space 7i(^C to the configuration averaged Green functions. The RM is a purely 
real-space based technique, and therefore as applicable to a bulk system as one 
with surfaces, interfaces or extended defects. In the following we shall describe 
each of the points raised above in some detail. 

2 Tight-binding linear muffin-tin orbitals method 

The TB-LMTO has been described in great detail earlier [5],[7]-[n]. We shall 
only quote here the main results which will be relevant for setting up the ASR 
programme. The starting point is the Kohn-Sham equation with the mufhn-tin 
effective crystal potential. The basis chosen for representation of the wave- 
function are the muffin-tin orbitals {|i?„L)} : If we expand the wave-function 
in terms of a linear combination of these muffin-tin orbitals and substitute the 
expansion in the Kohn-Sham equation we obtain the Korringa-Kohn-Rostocker 
(KKR) secular equation : 

det II P^^^^iE)Sg^gjLL, - S^^^j^^^,{n) |h 
Energy linearization gives the eigen-type LMTO secular equation: 

det||i?<5^-„«jLL'-i?£^_^_^^, IhO (1) 
where, the 'second order' Hamiltonian is given by : 

R,^ 

Note that each element is a matrix in the L space. In the screened repre- 
sentation the structure matrix S_p ^ is short-ranged and the Hamiltonian is 
sparse : 

^«„,«„, = (£fl„ h^R^ +kt ^r.mJo) 
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The expansion energies E^l are chosen suitably by us at the center of the 
energy window of our interest and the potential parameters C, A and o are 
diagonal in L space and are self-consistently generated. The structure matrix S_ 
is obtained from the geometry of the lattice. Note that the TB-LMTO basis is 
minimal and hence usually it is enough to take £ < 3 and in a majority of cases 
one does not have to go beyond i — 2 

At this point we should comment on several possible generalizations : we 
have energy linearized the secular equation. In case we do not wish to do so, 
we can still deal with a energy dependent "Hamiltonian" or "secular matrix" 
EiE) = EiE)-g{K). This is the TB-KKR. The subsequent recursion becomes 
energy-dependent, however each recursion at each energy point can be parallely 
carried out for efficiency. Such energy dependent recursion has been carried out 
by us earlier |10j . The assumption (a posteriori shown after calculations) was 
that the recursion coefficients are weakly energy dependent, therefore recursion 
is carried out an equi-spaced "seed" points and the intermediate points found 
by interpolation. Further, if we allow for third-nearest neighbour sparsity in 
the Hamiltonian we could start with the real-space full-potential LMTO (RS- 
FPLMTO) of Eriksson et.al.[II]. 

3 The Recursion method 

Whereas, the above formulation is general enough, we should stress on the fact 
that the Hamiltonian representation is an infinite matrix. Lack of lattice transla- 
tion symmetry means that we are unable to symmetry reduce the dimensionality 
of the matrix to essentially 2£rnax + 1- We therefore have to resort to techniques 
which allow us to deal with infinite matrices. The Recursion method was ex- 
actly such a technique introduced by Haydock et.al.[T]-[5] to obtain the Green 
functions associated with the secular equation. This would give us the density 
of states from which we may obtain properties like the Fermi energy, the charge 
and magnetization densities, the magnetic moment and the band energy. A re- 
cent generalization also gives us correlation functions associated with response 
functions p^-[l3]. 

The RM begins by recursively changing the basis through a three term re- 
currence relation : 



|1) 
|n + l> 



\R,L) and |2)=H(2)|1) 
H(2)|n) -a„|n) -/32|n-l) 



for n > I 



(4) 



Mutual orthogonality of this new basis gives : 



(n|H(^)|n) 
(n|n) 



and Pi 



{n + l|n + 1) 



(5) 
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To understand how the above equations are efficiently operationally coded 
on the computer, let us first describe the Hamiltonian as an operator H'^^^. 
From Eqn.(121) 



H(2) 



h — h o h and 



(6) 



Of these operators four are 'diagonal' : E^, C, A^/^ and o. Their structure 
are all of the form : D = J^i Hi \Ri){Ri \ = J2iI2iT^i- While the structure ma- 

trix is off-diagonal : S = E(.j)ii5,i?, (l^^X^. I + l^jX^d) = E(.,) ifl^fl^^j"- 
If we represent a Hilbert space 'vector' \n) by a matrix as shown below : 



n2i 



n2L 
npL 



( n.1 \ 

212 

Up 



Upi np2 

V ■■ J \ ■■ J 

The action of the diagonal operators D is rather simple: 

(i) One by one choose non-zero rows Hi of \n) 

(ii) Multiply Dji n[ and add this to the i-th row of a new |n'). 

(iii) Repeat this for all non-zero rows of |n). Finally |n') = D|n) 

The action of the off-diagonal operator S is more complicated as it has the 
information of the underlying lattice or network embedded in it. The first step 
would be to prepare the "neighbour map" (NM). The NM is a matrix N_ = Nij 
where Nij is the j-th neighbour of the i-th site on the lattice or network. We 
have first to number the lattice/network points by integers. The near neighbour 
vectors point in different directions, while numbering this directionality has to be 
carefully recorded, since the structure matrix between i?„ and Rm will depend 
upon the direction of Rn — Rm- In figure [T] we show the geometry of a bulk 
square lattice, a square lattice with a plane surface (100), one with a corrugated 
surface (110) and a network with four local bonds. 

Since the integer numbering increases always clockwise from a site, the di- 
rectionality is preserved. The neighbour maps, also preserving directionality, 
for the first three examples are : 
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Figure 1: The geometry of (top, left) a bulk square lattice (bottom,left) a square 
lattice with a plane (100) surface (top,right) a square lattice with a corrugated 
(110) surface and (bottoni,right) a fourfold coordinated planar network 



5 



The first four columns give us the neighbours in these four directions and 
the last column gives us the number of neighbours. For the fourth example of 
a fourfold coordinated planar random network, the neighbour map is the same 
for the square lattice, however the S is different. 

The action of S on a vector \n) is prompted by the relevant neighbour map. 
The operations proceed as follows : 

(i) One by one choose rows rii of \n) 

(ii) From the neighbour map choose m — Nik (which is the fc-th neighbour of 

(iii) Multiply SjniljJ and add this to the m-th row of a new \n') 

(iv) Repeat this for all the non-zero rows of \n). Finally \n') = S|n). 

The the recursive operations described in Eqns. Q can be encoded into the 
following modules : 



A. Module HOP (|<i>^): I'I'i)), which describes the action of the Hamiltonian 
on a 'vector'. 



The input 


into 


this module is 


: |n) 


=> 1^2) 








l*i> 


; S|vi/i) = 


^ |«'2) 


; Ai/2|$2) 


^1^1) 


(C-E,)|$2) 




1*2) 


; l<i'i> = 


1*2) + 


|*i> ^h|n) 




o|$i) 




l*3> 


= oh|n) 








Al/2|xI/3) 




l*l> 


; S|vi/i> = 


1*2) 


; Ai/2|vi/2) 


^l*i> 


(C-E,)|v|/3) 




l*2> 


; \'^i) = 


l<fl>- 


|*2> - 1*1) 


= [h - hoh] \n) 


E.I $2) 




1^1) 


; 1*1) = 


|<fl) + 




In) 


The output 


from 


this module is 


1*1) 


=^H(2)|n) 





B. The Module REC f |<I>2), |$3), /?^; a„, /3^_,_i) which calculates the coefScients 
an,Pn+i recursively : 
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The input 


into 


this module is : 


|<i>2) = |n) ; |ci>3) = |n-l) and 


Run 




H0P(|$2);|$i)) 


an = (5'l|«'l)/($2|$2) 








-/3'*l'i>3) /3^ = ($l|$l)/($2|$2> 


1*3) 




|$2) and |$2) = 




The output 


from 


this module is : 


$2) = |n+ 1); |$3) = \n) and a„,/3^ 



The two vectors |<i>i) and |$2) are input/output vectors of the Module HOP, 
while the three vectors l^*!), 1^2) and 1^3) are dummies which are needed within 
the module. The space for these dummies may be dynamically allocated during 
this procedure and released after the operation is over. The same is true for the 
vector in the Module REG. 



Eqn. (O indicates that in the new basis the Hamiltonian representation is 
tri-diagonal. It follows immediately that The Green function for the system is 
given by a continued fraction : 



z ~ ai 



Pi 



z — 012 — 



Pi 



z - a„o - Pl„T{z) 



(7) 



The asymptotic part of the Green function T{z) is called the Terminator. 
Many terminators are available in literature. The suitable one depends upon 
the way in which the coefficients q;„, behave as n ^ 00. The most commonly 
used is the square root terminator which is suitable when {a„,/3^} {a,/3^}. 
Luchini and Nex [14] have suggested a modification when calculations up to a 
large hq is not possible. If we carry out the terminator approximation after 
riQ steps the first 2no moments of the density of states are exact and it has 
been shown that the asymptotic moments are also accurately reproduced. Beer 
and Pettifor [15] have suggested an alternative way of obtaining the termina- 
tor. They note that T{z) = T(z, {a„, /3„}, n < no) so that the Beer-Pettifor 
prescription is a closed algorithm. It wins over the square-root terminator de- 
scribed above when the convergence of the coefficients is either oscillatory or 
slow. Viswanathan and MuUer [TB] have suggested several other terminators 
like the terminator with an exponential tail, suitable when —> n and the 
terminator with a Gaussian tail, suitable when n? . We shall have options 

for several terminating procedures so that the user may choose according to 
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his need. C. The Module GREEN f£„,n;^, rtn, Emin, Emm-) which calculates the 
Green function. 



Loop 




Input 


=^ |$2) = \RiL), 1*3) = |0) ; = 1 


Loop 


(n = 1, no) 


R,iin 






End Loop 




End Loop 


Loop 


{E 1, E^iax ) 


Loop 


{E Eyyiiyi, E^iax) 


Run 


^ TERM({a„,/32},no,^,G) 


Loop 


[n = no, 1, -1) 


G 


1 


E-an- PI * G 




End Loop 




End Loop => Output = G ^ r l^^^ 



The partial density of states, projected onto a particular (RiL), is : 

^R.LiE) = -l Ihn 9m G^^^ j^^^iE + iS) (8) 



It is clear from this discussion that the recursion procedure is general enough 
to deal with lattices of any complexity, surfaces and interfaces and disordered 
networks. 

4 The augmented space formalism 

Finally we come to the main part of the package : that which deals with dis- 
order. The augmented space formalism [5] allows us to directly compute the 
configuration average of the Green function by constructing a Hamiltonian in 
the augmented space of configurations of the random parameters. The formal- 
ism is based on ideas prevalent in quantum measurement theory : we associate 
with each random parameter m an operator Ni whose spectrum {n^} are the 
measured values of the parameter. The configuration state of n^ are the eigen- 
kets {|n^)} of Ni, which, therefore is an operator in the space Ci spanned by 
the different configuration states. Given this, the probability density of the 
parameter is 

p{n,)^~- (0,|((n, +iO)/-iV,)"'|0,) 
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where 



X 

The augmented space theorem [6] then gives us the configuration average of 
any function of the random variables : 



« »= (0|$({iV,})|0) (9) 

Here, 

(i) |0)=nf |04- It is a member of the basis set I {C}) = {\n\\ . . .)} 
which spans the configuration space \1/ of the set of random parameters 
{ni\. 

(ii) $({A^i}) is an operator in the configuration space ^ — J|®C.j. It is the 
same operator functional of {Ni] as ^{{rii]) was a function of {rii}. 

(iii) The expression is exact. The configuration average is done exactly first, 
then the approximations may be carried out maintaining various physical 
constraints. The philosophy is very different from the mean-field approx- 
imations like the CPA. 



If we apply this to the configuration averaged Green function of a random 
Hamiltonian : 



(10) 

Let us take as an example the case of a disordered binary alloy : the Hamil- 
tonian has the same form as Eqn. ^ but : 



Qi — C_A n-i + C_B (1 ~ "-i) where C_i = C_i — and 

= ni + As ^ (1 - ni) and gi = gA rii + gs (1 - rii) 

The random "occupation" variables Ui take the values and 1 with proba- 
bilities proportional of the concentrations x and y of the constituents A and B. 



Ni has a 2 X 2 representation : 




. If we denote the basis of the 



above representation by | ti), | li), then 
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and each of the potential parameters become operators in the configuration 
space, for example : 

C = ^ { « » X + B{Qi) Vu + F{g,) Tta, } ® T^, 

i 
i 

o = > 1+ s(g,) ^i. + ^^fe) Tr.iJ f^^^ (11) 

where B{C) = (y - x)(Ca - Cb) and J^(C) = ,Jxy[CA ~ Cb)- Also, since 
the structure matrix is not random : 

S = Eii?.i?. (12) 

H(2) = E^ + h-hoh where h ^ C + A^/^ ^ S «) A^/^ (^^g) 

If we now compare Eqns. pO|) and (fT3l) with Eqns. ([6])- ([7]) it becomes clear 
that once we have defined the Hamiltonian in the space of configurations, the 
recursion method may be directly used to obtain the configuration averaged 
Green function. The approximation involved will then only be the "termina- 
tion" approximation. Heine's "Black-body theorem" [T7] indicates that most 
electronic structure energetics is dominated by the immediate environment of a 
site. This is a major justification of the termination approximation. Unlike the 
CPA which gives only the first eight moments of the density of states exactly 
and the asymptotic moments accurately, the augmented space recursion gives 
2no moments exactly (where the termination is done after rig steps) and the 
asymptotic moments also accurately. In most of our calculations we can take 
riQ about 8-9 steps. 

5 The TB-LMTO-ASR Algorithm 

The full package is divided into several modules. The first is the Preparation 
Module. This module has two branches, one each for each constituent of the 
alloy. Each branch runs parallely in two different slave processors. The structure 
matrix is prepared in the master processor. This Module prepares the following 

(i) It prepares the two control files for the different alloy constituents. This 
part is interactive with the user. 
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(ii) It carries out the simple Hartree calculation and prepares the atomic radii 
of the constituents and the initial charge density. 

(iii) It calculates the overlap between atomic spheres and inputs empty spheres 
maintaining the symmetries. 

(iv) It calculates the structure matrix from the inputs of the control files. 

The Preparation Module is followed by the main ASR-module. This module 
is called by the routine Imasr. This main module is divided into five smaller 
modules : 

(i) Module A : This module reads the data generated by the Preparation 
Module. It checks the inputs for consistency. The routines in this module 
are again those in the Stuttgart LMT047, but modified to read the inputs 
for both the two constituents of the alloy. 

(ii) Module B : At the start of the DFT self-consistency loop, this module 
takes the overlap of a simple Hartree atomic density calculation in the 
Preparation module and generates the Hartree and exchange-correlation 
potentials, spheridizes them and inputs them to the Atomic Module that 
follows. In later steps of the self-consistency loop, this Module first mixes 
the charge densities of the earlier steps and prepares the Hartree and 
exchange-correlation potentials, spheridizes them for input into the Atomic 
Module. 

- At this point there is a choice of using either the standard DFT exchange- 
correlation potentials or, alternatively, there is a branch module Harbola- 
Sahni which sets up the Harbola-Sahni potential for the study of excited 
states [19]. 

(iii) Module C : This is the Atomic Module. It takes the spheridized Kohn- 
Sham potential generated in Module B and solves the radial Kohn-Sham 
equation numerically. The Kohn-Sham orbitals and energies then lead 
to the potential parameters for each constituent. Those for the two con- 
stituents are calculated on different processors. The parameters are first 
calculated in the orthogonal representation. A new routine gtoa, not 
present in the Stuttgart LMT047 package, then transforms them to the 
most screened tight-binding representation. 

(iv) Module D : This is the main ASR Module. The routines herein have 
been fully developed by us and form the main backbone of the package. 
The input are the tight-binding Hamiltonian parameters from the Atomic 
Module. First they are combined with the alloy composition to prepare the 
augmented space Hamiltonian. The nearest neighbour map in augmented 
space is then generated. Next, the recursion is carried out for each L 
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Figure 2: (Color Online) The TB-LMTO-Augmented Space Recursion package 
flowchart 



12 



value, terminators generated and the L projected density of states are 
calculated. Each different recursion for each L value is carried out on a 
different processor, thus vastly accelerating the calculations. 

We then proceed to calculate the total density of states and the Fermi 
energy. Again, branching out into different processors, we calculate the 
L-dependent moments and magnetic moments. Of all the Modules, this 
is the one amenable to maximum parallelization. 

- At this point we have the possibility of introducing short ranged or- 
der. The ASR for short-ranged order has been described in some detail 
earlier [5D]. The branching for this choice occurs just before we set up 
the augmented space Hamiltonian. The extra input is the Warren-Cowley 
short ranged order parameter. 

- Also at this point we have the option to introducing disorder in the 
structure matrix because of size mismatch between the two constituents 
of the alloy. The branching now takes place earlier in the Preparation 
Module where we generate not one, but three different structure matrices 

■ §=ki^^§fj^ ^^'^ §^f- '^"^^ Poi"^^' approximation [21 the Eqn. ^ 

is now replaced by : 

S = ^ { « » X + ^Ij) (M M ® AAj } ® %j (14) 

where M = (y - x)Vi^ + v^^ra.' i^^^ §,^^ " §ij^ and ^(2) ^ 
§zif' ~^ ~ '^§if ■ '^^^'^ modification wiU be available in this module. 

It is in these last two options that the ASR really scores over the CPA, 
which cannot really deal with either short-ranged order or off-diagonal dis- 
order as both involve more than one site. The competing methodology is 
the special quasi-random structures (SQS) [25]. However, if we are dealing 
with materials with many atoms per unit cell and non-stoichiometric com- 
positions, the SQS required will be rather large and will involve use of huge 
unit cells. Here the TB-LMTO-ASR with the use of much smaller unit 
cells will score. We have shown earlier |23] that both these two methods 
give virtually the same results for the density of states. 

(v) Module E : In this module the L-dependent moments are used to ob- 
tain the charge density. We also calculate the total energy, including the 
Madelung term. For the disordered alloy, the Madelung term is obtained 
from the procedure suggested by Skriver and Ruban [24 . Finally the old 
and new moments are mixed, the mixed charge density thus obtained is 
input back in Module B. This is iterated till convergence in both energy 
and charge density is achieved. 
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Figure 3: (Top) DOS for CixFei-x for different compositions : (left) CrrFea; 
(middle) CrsFes; (right) CrsFey; (Bottom) DOS for Cu^^Zni-a; for different com- 
positions : (left) Cu24Zn76; (middle) CusoZnso; (right) Cu75Zn25. Energy(Ryd) 
is plotted along x-axis; y-axis is DOS (states/Ryd) 



For the DFT part, our code depends heavily on Stuttgart-LMTO routines 
developed by Anderson and co-workers [25j Two independent DFT codes run 
parallely to produce the potential parameters of the two constituent atoms of 
the binary alloy. These potential parameters are used at the input point of our 
ASR routines in Module D. 

Our present DFT modules deal only with coUinear magnetism. For spin 
dependent calculations the Hamiltonian is separable in spin space. Thus the 
spin is merged with the label L which is now {£, m, a} and we have just to 
carry out twice the number of recursions : for a =t ^-nd i- I^i case we wish 
to introduce spin-orbit coupling and possibility of non-collinear magnetism, we 
have to replace the DFT module with one dealing with density matrices, rather 
than densities [26j and the ASR module with one applying generalized or vector 
recursion [12]- [13] which was designed to deal with Hamiltonians whose repre- 
sentation in spin-space or 'spinor' bases is not diagonal : 

«aiiiB»=(ir i;:) 
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This new module is under preparation [57] and will be incorporated once 
the basic checks are carried out. Here we mention this in order to bring out the 
different possibilities and versatility of the package. 



6 Applications and performance analysis 

We have described the contents and commented upon the efficiency and ver- 
satility of the TB-LMTO-ASR package. We shall conclude by describing two 
different applications : namely, the alloys Cua;Zni_2: where both the constituents 
are non-magnetic in the bulk and the d-band centres of Cu and Zn are well sep- 
arated; and Fe^Cri-a; where both constituents are magnetic and their d-bands 
overlap. 

For a single self-consistency loop 

Old TB-LMTO-ASR New TB-LMTO-ASR 
Wall time 699 sec 225 sec 

Effi™y(^) 0.714 0-97 



The alloys have been studied earlier by us using the old version of the TB- 
LMTO-ASR [4],[28l-[29] and TB-LMTO-CCP^ll [31], both introduced by us, 
and the KKR-ICPA|j [32^ which is also based on the augmented space formalism, 
as well as through the KKR-CP^l [34 , PAW-SQdll^ and KKR-NL-CP^l 
[55] . They are therefore ideal system for benchmarking the new TB-LMTO-ASR 
package. Comparison of the densities of states shown in Fig. [3] with the results 
shown in the above references will convince us that for Fe^^Cri-j; there is hardly 
anything to choose between the various techniques and the packages based on 
them. However, for the spht band alloy Cu^^Zui.^ the TB-LMTO-ASR, TB- 
LMTO-CCPA and KKR-NL-CPA scores over the CPA versions, as expected 
from earlier analysis. 

One of our our main point of interest is the relative runtime and efficiency 
of this new version of the TB-LMTO-ASR, as compared to the several earlier 
versions proposed by us. To benchmark these characteristics we take a specific 
calculation on Fe^^Cri-j,. Each calculation is done on a 73109 site augmented 
space map, with N recursion steps with L — s,Px,eg and t2g), and a = 1,2, 
followed by a Beer-Pettifor termination scheme . The old and new versions of 
the TB-LMTO-ASR are compared in the Table [6j Here Wall time is the same as 
run time and Efficiency is the ratio between the Wall time and CPU time. Gnu 
profiles (gprof jfl for the new TB-LMTO-ASR output and the old serial version 

^Tight-binding linear muffin-tin orbitals cluster coherent potential approximation 
^Korringa-Kohn-Rostocker itinerant coherent potential approximation 
•^Korringa-Kohn-Rostocker coherent potential approximation 
■^Projector augmented wave special quasi-random structures 
^Korringa-Kohn-Rostocker non-local coherent potential approximation 
® http: // www.es . Utah, edu / dept / old / texinfo /as/ gprof_ toe. html 
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are given in the Table 



% 


cumulative 


self 




self 


total 




time 


seconds 


seconds 


calls 


s/call 


s/call 


name 


49.91 (61.47) 


100.73 (397.33) 


100.73 (397.33) 


81 (128) 


1.24 (3.10) 


2.26 (5.01) 


hop 


40.78 (37.67) 


183.04 (640.87) 


82.31 (243.54) 


1.6(3.9) X 10^ 


0.00 (0.00) 


0.00 (0.00) 


matp 


0.76 (0.79) 


196.82 (646.41) 


1.53 (5.11) 


7(16) 


0.22 (0.00) 


0.22 (0.32) 


spectral 


0.00 (0.00) 


201.84 (646.40) 


0.00 (0.01) 


1.8 (3.5) xlO^ 


0.00 (0.00) 


0.00(0.00) 


splint 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


206 (384) 


0.00 (0.00) 


0.00 (0.00) 


spline 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


56 (56) 


0.00 (0.00) 


0.00 (0.00) 


matmult 


0.00 (0.00) 


201.84 (646.44) 
201.84 (*) 


0.00 (0.00) 


36 (36) 
28 (*) 


0.00 (0.00) 


0.00 (0.00) 
6.59 (*) 


mom 


0.00(*) 


0.00 (*) 


0.00 (*) 


doparallel 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


7 (16) 


0.00 (0.00) 


0.00 (0.00) 


fit 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


4 (7) 


0.00 (0.00) 


0.00 (0.00) 


tdos 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


1 (1) 


0.00 (0.00) 


0.00 (0.00) 


band 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


1 (1) 


0.00 (0.00) 


0.00 (0.00) 


fcrmi 


0.00 (0.00) 


201.84 (646.44) 


0.00 (0.00) 


1 (1) 


0.00 (0.00) 


0.00 (0.00) 


pardos 



Table 1: gprof data for the optimized and parallelized ASR code and (in brakets) 
the old serial code. The discontinuity between the cumulative time of spectral 
and splint is due to the machine routine which are not included here. 



7 Conclusion 

We have presented here a computational package that combines three different 
techniques to allow us to study the electronic properties of disordered systems. 
The package can deal with bulk disordered alloys as well as surfaces and inter- 
faces, both smooth, stepped and rough and also structurally distorted lattices. 
The package can be generalized at many stages which have been clearly com- 
mented upon and the generalizations are being carried out step by step. 
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